CUBE File: A File Structure for Hierarchically Clustered OLAP Cubes

نویسندگان

  • Nikos Karayannidis
  • Timos K. Sellis
  • Yannis Kouvaras
چکیده

Hierarchical clustering has been proved an effective means for physically organizing large fact tables since it reduces significantly the I/O cost during ad hoc OLAP query evaluation. In this paper, we propose a novel multidimensional file structure for organizing the most detailed data of a cube, the CUBE File. The CUBE File achieves hierarchical clustering of the data, enabling fast access via hierarchical restrictions. Moreover, it imposes a low storage cost and adapts perfectly to the extensive sparseness of the data space achieving a high compression rate. Our results show that the CUBE File outperforms the most effective method proposed up to now for hierarchically clustering the cube, resulting in 7-9 times less I/Os on average for all workloads tested. Thus, it achieves a higher degree of hierarchical clustering. Moreover, the CUBE File imposes a 2-3 times lower storage cost.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Dynamic Grid File for High-Dimensional Data Cube Storage and Range-Sum Querying

In this article, the authors propose to use the grid file to store multi-dimensional data cubes and answer rangesum queries. The grid file is enhanced with a dynamic splitting mechanism to accommodate insertions of data. It overcomes the drawback of the traditional grid file in storing uneven data while enjoying its advantages of simplicity and efficiency. The space requirement grows linearly w...

متن کامل

Initial explorations into the user experience of 3D file browsing

We present an initial exploration into the usability of 3D file browsing. To explore the 3D file browsing technique design space, we analyzed the existing literature and developed three representative 3D file browsing techniques that cover many of their characteristics. Block3D uses a priority weighting scheme to elevate and display files in a grid-based structure. Cluster3D uses sets of animat...

متن کامل

SISYPHUS: The implementation of a chunk-based storage manager for OLAP data cubes

In this article, we present the design and implementation of SISYPHUS, a storage manager for data cubes that provides an efficient physical base for performing OLAP operations. On-Line Analytical Processing (OLAP) poses new requirements to the physical storage layer of a database management system. Special characteristics of OLAP cubes such as multidimensionality, hierarchical structure of dime...

متن کامل

C-CUBE: Un nouvel opérateur d'agrégation pour les entrepôts de données en colonnes

RÉSUMÉ. Les bases de données orientées colonnes offrent au domaine décisionnel le modèle le plus approprié au stockage des entrepôts de données. Cependant, en l’absence d’opérateurs d’analyse en ligne, le seul moyen, très coûteux, qui existe pour construire des cubes OLAP consiste à utiliser l’opérateur UNION sur des requêtes de regroupement afin d’obtenir l’ensemble des Group By nécessaires au...

متن کامل

Cubes of Concepts: Multi-dimensional Exploration of Multi-valued Contexts

A number of information systems offer a limited exploration in that users can only navigate from one object to another object, e.g. navigating from folder to folder in file systems, or from page to page on the Web. An advantage of conceptual information systems is to provide navigation from concept to concept, and therefore from set of objects to set of objects. The main contribution of this pa...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004